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ABSTRACT 

Contextual classifiers are being de- 
veloped as a method to exploit the spati- 
al/spectral context of a pixel to achieve 
accurate classification. Classification 
algorithms such as the contextual classi- 
fier typically require large amounts of 
computation time. One way to reduce the 
execution time of these tasks is through 
the use of parallelism. The applicability 
of the CDC Flexible Processor system and 
of a proposed multimicroprocessor system 
(PASM) for implementing contextual clas- 
sifiers is examined. 

I. INTRODUCTION 

Contextual classifiers are being de- 
veloped as a method to exploit the spati- 
al/spectral cortext of a pixel to achieve 
accurate classification. Just as in writ- 
ten English one can expect to find certain 
letters occurring regularly in particular 
arrangements with other letters (qu, ee, 
est, tion) , so certain classes of ground 
cover are likely to occur in the "context" 
of others. The former phenomenon has been 
used to improve character recognition ac- 
curacy in text-reading machines. Ke have 
demonstrated that the latter can be used 
to improve accuracy in classifying remote 
sensing data [1-3]. Intuitively this 
should not be surprising since one can 
easily think of ground cover classes more 
likely to occur in some contexts than in 
others. One does not expect to find 
wheat growing in the midst of a housing 
subdivision, for example. A close-grown, 
lush vegetative cover in such a location 
is more likely the turf of a lawn. 

Classification algorithms such as the 
contextual clascj.fj.er (and even much sim- 
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pier algorithms used for remote sensing 
data analysis) typically require large a- 
mounts of computation time. One way to 
reduce the execution time of these tasks 
is through the use of parallelism. Vari- 
ous parallel processing systems that can 
be used for remote sensing. have been 
built or proposed. The Control Data Cor- 
poration Flexible Processor system is a 
commercially available multiprocessor sys- 
tem which has been recommended for use in 
remote sensing [4,5]. PASM is a proposed 
multimicroprocessor for image processing 
and pattern recognition [6], 

Section II briefly describes the con- 
text classifier and gives an algorithm for 
performing it. The use of the Flexible 
Processor system to implement the classi- 
fier is explored in Section III. The use 
of PASM to implement the classifier is 
discussed in Section IV. 


II. THE CONTEXTUAL CLASSIFIER 


The image data to be classified are 
assumed to be a two-dimensional I-bv-J 
array of multivariate pixels. Associated 
with the pixel at "row i" and "column j" 
is the multivariate measurement n- vector 
X. . c R n and the true class of the pixel 


C i j e ^ = i > • • • > “ c ' • 

have class-conditional 
f(Xju> k ) , k = 1 , 2 , ... ,C, 


The measurements 
densities 

and are assumed to 


be class-conditionally independent. The 
objective is to classify the pixels in the 
array. 


In order to incorporate contextual 
information into the classification pro- 
cess, when each pixel is to be classified 
p-1 of its neighbors are also examined. 
This neighborhood,, including the pixel to 
be classified, will be referred to as the 
p-array. Intuitively, to classify each 
pixel, the contextual classifier computes 
the probability o:f the given observed 
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pixel being in class k by also considering 
the Measurement vectors (values) observed 
for the neighbor pixels in the p-array. 
Specifically, for each pixel, for each 
class in ft, a discriminant function g is 
calculated. The pixel is assigned to the 
class for which g is greatest. Each value 
of g is computed by summing the weighted 
probabilities of the p-1 neighbor pixels 
occurring in all possible classification 
states. This is described below mathema- 
tically for pixel (i,j) being in class 

The description is followed by an example 
to clarify the notation used. Further de- 


where 

X £ eX. . is the measurement vector from the 
- 1 fcth pixel in the p-array (for pixel 
( i , j ) ) 

0 t c — ii * s the class the ith pixel in the 
3 p-array (for pixel ( i , j ) ) 

f (X £ |0 t ) is the class-conditional density 
of X £ given that the ith pixel is 
front class 

G p (0..) = G p (6 , , 0, , . . . , 9. ) is the a priori 
probability of observing the p-array 



Within the p-array, the pixel locations 
may be numbered in any convenient but fix- 
ed order. The joint probability distribu- 
tion g? is referred to as the context 
distribution. 


= f (X^ | b) f (X 2 | a) f (X 3 |a)G(b,a,a) 

+ f (X 1 |b)f (X 2 |a) f (X 3 |b)G(b-,a,b) 

+ f (XjJb)f (X 2 | b) f (X 3 |a)G(b,b,a) 

+ f (X x |b) f (X 2 |b)f (X 3 |b)G(b,b,b) 

Note that G^ (£ i j) = G(0 1( e 2 ,e 3 ) is the 

relative frequency of occurrence in the 
scene of the specific neighborhood confi- 
guration ( e x , e 2 , e 3 > • 

After computing the discriminant functions 
g a and g fa for pixel (i,j), pixel (i,j) is 

assigned to the class which has the larger 
discriminant function value. 

Algorithm 1, shown in Figure XI. 2, is 
one way to implement the contextual clas- 
sifier. The particular classifier consid- 
ered here uses a horizontally linear 
p-array of size three. This is shown in 
Figure II. 1. 


First consider the main loop. Lot 
the original image to be classified be an 
I-by-J array called A. Columns 0 and J-l, 
the two side edges of the image, are not 
classified since these pixels will not 
have both right and left neighbors.. The 
variable "value" will contain the maximum 
"g” (discriminant function) value calcu- 
lated for pixel li,j). This variable may 
be updated as the "g" for each class is 
calculated. The variable "Class” is the 
class associated with "value." In the 
main loop, “g(i,jj,k)" is a call to a 
function to calculate the discriminant 
function for pixel (i,j) and class k.- 
This function is called I * (J-2) * C 

times, once for each class for each pixel 
being classified. 


tails may be found in [1,2,7]. 
T P 


g k ( £ij )= 


I 


2ij « 

6 ij “ “k 


ft f(x £ ja £ ) 
i=l 


To clarify the computation of the dis- 
criminant function, consider the following 
example. Let the context array (neighbor- 
hood) be the p= > choice shown in Figure 
II. 1 with the pixels numbered such that 
the ; ixel (i,j) to be classified is asso- 
ciated with Xj and 0 £ , pixel ( i , j — 1 ) is 

associated with X^ and and pixel 
(i,j+l) is associated with X^ and 0 3 . 

Assume there are two possible classes: ft = 
{a,b}. Then the discriminant function for 
class b is explicitly 



Consider the calculation of g(i,j,k). 
The class of pixel (i,j) is held constant 
at k, while all other possible class 
assignments are considered for pixels 
( i , j — 1 ) and (i.j+l). For each assignment 
of classes for tihe pixels neighboring 
pixel (i,j), of which there are C*C, the 
product of the class-conditional densi- 
ties ("contpf") is weighted by "G(r,k,ql," 
the a priori probability of observing the 
3-array (o r ,u^,Wg>. The "G" array is pre- 
determined and prestored. For each call 
"g(i,j,k),“ the value of "sum" for that 
i,j, and k is calculated. "Sum" is then 
returned as the 'value of "g(i,j,k).“ In 
this straightforward version of the 
g(i,j,k) routine,, the function to compute 
a class-conditiooial density ("compf") is 
called C*C times each time "a" is called. 


/ 


/ 
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Now consider the "compf" routine. 
This calculates the class-conditional 
density for pixel (a,b) and class k using 
the following equation: 


f(x|k)=e l09|5:k f " i (X "^ >T £ k 1 (X - m k ) ] /2 


where the measurement vector for each 
pixel is of size four, is the inverse 

covariance matrix for class k (four-by- 
four matrix) , m^ is the mean vector for 
class k (size four vector) , "T" indicates 
the transpose, and "log" is the natural 
logarithm. For each class., the algorithm 

uses log|f^ |,£^, and m^ as precomputed 

constants. For each call "compf (a,b,k), n 
the value of » e ex P°” for that a,b, and k 

is calculated. " e ex P°" i s then returned 
as the value of "compf (a ,b,k) . " 


Algorithm 1 executes the "compf" sub- 
routine I*(J-2)*C 2 times. Since for each 
pixel there are C "f"s (class-conditional 
densities), this approach is inefficient 
by a factor of c 2 - Algorithm 2 rectifies 
this problem by saving certain "f" values 
rather than recalculating them. 

The Algorithm 2, shown in Figure XI. 3, 
implements the contextual classifier with- 
out the redundant executions of "compf" 
that occur in Algorithm 1. Let X, Y, and 
Z correspond to the pixels (i, j— 1) , (i,j), 

and (i,j+l), respectively, where (i,j) is 
the pixel to be classified. Each of X, Y, 
and Z is a vector of size C. Element t 
of X will contain the class-conditional 
density ("compf") for the current ( i , j — 1 ) 
pixel for class t. Y and Z are defined 
similarly. By using these three vectors 
to save the class-conditional densities, 
each density (for a given pixel and class) 
i| calculated only once, instead of 
C times. 

The main loop of Algorithm 2 is modi- 
fied to calculate the class-conditional 
densities for the first three columns each 
time a new row is considered (i.e., each 
time "i" is incremented) . Each time a new 
pixel in a given row is to be classified 
(i.e., just before "j" is incremented), 
these values are updated. In particular, 

X gets the Y values, Y gets the Z values, 
end new values are calculated to update Z. 

The new discriminant function calcu- 
lation, g 1 , does not call the subroutine 


"compf." It gets the values it needs from 
the X, Y, and Z arrays. For each call 
"g* (k)," the value of "sum" for that k is 
calculated. "Sum" is then returned as the 
value of "g* (k) . " 

The same, "compf" routine is used for 
both Algorithms 1 and 2. Algorithm 1 
calls this routine I* (J-2) *c3 times, while 
Algorithm 2 calls it only I*(J-2)*C times. 


There are other techniques that can be 
employed to make Algorithm 2 even more ef- 
ficient that have not been included in or- 
der to avoid obscuring the basic program 
flow. 

The serial complexity of Algorithm 2 
can be calculated in terms of assignment 
statements, multiplications, additions, 
and "compf” calculations. To initialize 
X, Y, and Z for new rows, I*C*3 assign- 
ments and calls to "compf" occur. For 
each pixel, at most C+l assignments to 
"value" and "class" occur, C assignments 
to "current" occur, and C calls to 

"g* (k) " occur. In addition, for each row, 
the X, Y, and Z vectors are updated J-3 
times, each update using 3*C assignments 
and C calls to "compf." Each execution of 

2 2 
"g’(k)" uses 3*C multiplications, C ad- 
ditions, and C 2 +l assignments. Thus, the 
total complexity for Algorithm 2 is.: 

I(J'C 3 +7C+2)-(2C 3 +14C+4) assignments; 

3C 3 I i J-2) multiplications; 

C 3 1 ( J - 2 ) additions; and 

I*J*C "compf" 

calculations. 

The growth is proportional to 

I*J*C 3 assignments, multiplications and 
additions, and I*J*C "compf" calculations. 

In this section, a contextual clas- 
sifier based on a horizontally linear 
neighborhood of size three has been ana- 
lyzed. Algorithms for contextual clas- 
sifiers using other size and shape neigh- 
borhoods would be analogous to the algo- 
rithms which were presented. 


Algorithms 1 and 2 are written for 
conventional uniprocessor systems. Sec- 
tions III and IV will examine how to im- 
plement Algorithm 2 on a CDC Flexible 
Processor system and on a multimicropro- 
cessor system such as PASM. 
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III. FLEXIBLE PROCESSOR SYSTEM IMPLEMEN- 
TATION OF THE CONTEXTUAL CLASSIFIER 

This section discusses programming a 
CDC Flexiblo Processor system [4] simula- 
tor to perform a size three linear neigh- 
borhood contextual classifier. The Flexi- 
ble Processor system is briefly overviewed. 
Then the simulation is described. 

The basic components of a Flexible 
Processor (FP) are shown in Figure III.l. 
Each FP is microprogrammed, permitting pa- 
rallel ism at the instruction level. An 
example of the way in which N FPs may be 
configured into a system is shown in Fi- 
gure I II . 2. There can he up to 16 FPs 
linked together, providing much parallel- 
ism at the processor level. The FPs can 
communicate among themselves through the 
high-speed ring or shared bulk memory. 

The clock c. cle time of each FP is 125 
nsec (nanoseconds) . Since 16 FPs can be 
connected in a parallel and/or pipelined 
fashion, the effective throughput can be 
drastically increased, resulting in a po- 
tential effective cycle time of less than 
10 nsec. 

An FP is programmed in micro-assembly 
language, allowing parallelism at the in- 
struction level. For example, it is pos- 
sible to conditionally increment an index 
register, do a program jump, multiply two 
8-bit integers, and add two 32-bit inte- 
gers -- all simultaneously. This type of 
operational overlap, in conjunction with 
the multiprocessing capability of the FPs, 
greatly increases the speed of the FP 
array . 

The following list summarises the im- 
portant architectural features of an FP: 

User microprogrammable. 

Dual 16-bit internal bus system. 

Able to operate with either 16- or 
32-bit words. 

125 nsec clock cycle. 

125 nsec time to add two 32-bit 
integers. 

250 nsec time to multiply two 8-bit 
integers. 

Register file (with 60 nsec access 
time) of over 8,000 16-bit words. 


In order to debug, verify, and time 
FP algorithms, a simulator for an array of 
up to 16 FPs has been developed. This 
simulator runs under the UNIX operating 
system on a PDP-11 series computer at 
LARS and has been used to program a maxi- 
mum likelihood classifier [1], An assem- 
bler tor the micro-assembly language has 
also boon developed. 


The experience gained through the use 
of the simulator has made evident the fol- 
lowing advantages and disadvantages of the 
system. 

Advantages: 

Multiple processors (up to 16) . 

User microprogrammable — parallelism 
at the instruction level. 

Connection ring for inter-FP communi- 
cations. 

Shared bulk memory units. 

Separate arithmetic logic unit and 
hardware multiply. 

Disadvantages: 

No floating-point hardware. 

Micro-assembly language — difficult 
to program. 

Program memory limited to 4k micro- 
instructions. 

More details about the FP may be 
found in [8]. Information about the assem- 
bler and simulator used at LARS to assem- 
ble and execute the FP programs for the 
contextual classifier is presented in [7], 

Consider the implementation of a con- 
textual classifier on an array of N FPs. 
Assume the neighborhood is, hori zontally 
linear, as shown in Figure 111.3. Divide" 
the A-by-B image into subimages of B/N 
rows A pixels long, as shown in Figure 
III. 4. Assign each subimage to a differ- 
ent FP. The entire neighborhood of each 
pixel is included in its subimage. Each 
FP can ‘.here fore execute the uniprocessor 
algoritnm presented in Section II on its 
own subimage. No interaction between FPs 
is needed, i.e., each FP can process its 
subimage independently. 

The LARS FP microassembler ar.d simu- 
lator are being used to gather statistics 
on the execution time for the size three 
horizontally linear neighborhood contex- 
tual classifier. Due to the fact that 
each FP is microprogrammable , determining 
program correctness and analyzing execu- 
tion times is done through the use of the 
microassembler and simulator. The current 
implementation of the contextual classi- 
fier U3cs 744 microinstructions, stored 
in the microniemory (see Figure III.l). 

The format of the data words of the pixel 
measurement vectors, covariance matrices, 
etc., consists of a 14-bit two's comple- 
ment exponent and a 17-bit sign-magnitude 
mantissa. The covariance matrices, loga- 
rithms of the determinants of the covari- 
ance matrices, a priori probabilities (CP), 
and the X, Y, and Z vectors are all stored 
in the large file (see Figure III.l). In 
this way, each FP has all Uic information 
it needs for per forming the class: : icat ion 
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on its subimage, The subimage data itself 
would be stored in a bulk memory (see Fi- 
gure III, 2). A multiple FP configuration 
which associates one bulk memory with each 
FP would be best for this application. For 
testing the FP contextual classifier pro- 
gram, the classification of one row of 
eight pixel measurement vectors (stored in 
the large file) using four classes is 
being evaluated. The FP contextual clas- 
sifier program is currently being debugged. 
The timing results of using the FP simula- 
tor to classify actual data using Algorithm 
2 (Figure II. 3) will be presented at the 
symposium.. 

For the horizontally linear neighbor- 
hoods, when using N FPs together to pro- 
cess an image, each FP handles 1/N-th of 
the image. Therefore, nearly a factor of 
N improvement is attained over the time 
required for one FP to implement the con- 
textual classifier. (A perfect factor of 
N improvement occurs if B is a multiple 
of N. The minor degradation in perfor- 
mance when B is not a multiple of N is 
discussed in [2].) Vertically linear and 
diagonally linear neighborhoods (Figure 
III. 5) can be processed in a manner simi- 
lar to that for horizontally linear neigh- 
borhoods [2]. 

Consider nonlinear neighborhoods, 
that is, neighborhoods which do not fit 
into one of the linear classes. For exam- 
ple, all of the neighborhoods in Figure 
III. 6 are nonlinear. It can be shown that 
there is no way to partition an image into 
N (not necessarily equal) sections such 
that a contextual classifier using a non- 
linear neighborhood can be performed with- 
out data transfers among FPs [2]. The way 
in which to assign pixels to FPs in order 
to minimize computation time will depend 
upon the particular image size, number of 
FPs used, the time required for inter-FP 
communications, and the shape and size of 
the neighborhood. A detailed analysis of 
the interaction of these factors is cur- 
rently under study. 


IV. MULTIMICROPROCESSOR IMPLEMENTATION 
OF TilE CONTEXTUAL CLASSIFIER 

This section describes a method for 
implementing the contextual classifier on 
a large-scale multimicroprocessor system 
such as PASM [6,9-11]. PASM is a dynami- 
cally reccnf igurable system being designed 
at Purdue University for image processing 
and pattern recognition tasks. The PASM 
design will support up to 1024 processors. 


Other computer architects have proposed 
parallel processing systems with 2^ 4 to 
2*® microprocessors [12,13]. The method 
for implementing the contextual classifier 
on PASM will be based on the use of the 
SIMD mode of parallelism. 

The acrorym SIMD stands for "single 
instruction stream — multiple data stream" 
[14]. Typically, an SIMD machine is a com- 
puter system consisting of a control unit, 

N processors, N memory modules, and an in- 
terconnection network. The control unit 
broadcasts instructions to all of the pro- 
cessors, and all active processors execute 
the same instruction at the same time. 

Thus, there is a single instruction stream. 
Each active processor executes the instruc- 
tion on data in its own associated memory 
module. Thus, there is a multiple data 
stream. The interconnection network, some- 
times referred to as an alignment or permu- 
tation network, provides a communications 
facility for the processors and memory 
modules. Examples of existing SIMD ma- 
chines include the Illiac IV and STARAN 
[15,16]. 

One way to model the physical struc- 
ture of an SIMD machine is shown in Figure 
IV. 1. As indicated, there are N proces- 
sing elements (PEs) where each PE consists 
of a processor with its own memory. The 
PEs receive their instructions from the 
control unit and communicate through the 
interconnection network. 

To demonstrate how SIMD machines ope- 
rate, consider the following simple task. 
Assume that A, B, and C are each one-dimen- 
sional arrays (vectors) and that the task 
to be performed is the elementwise addi- 
tion of A and B, storing the result in C. 

In a uniprocessor system, this can be ex- 
pressed as: 

for i * 0 to N-l do 

C (i) = A(i) + B ( i) 

This computation will take N steps on a 
serial machine. 

Assume that A, B, and C are stored in 
a SIMD machine, with N PEs, such that A(i) , 
B(i), and C(i) are all stored in the memory 
of PE i, 0 < i < N. To perform an element- 
wise addition of the vectors A and B and 
store the result in C, all PEs would exe- 
cute (simultaneously) 

C = A + B 

with PE i doing the addition of A(i) and 
B ( i ) , storing the result in C(i). Thus, 
in this case, the SIMD machine does in one- 
step a task requiring N steps on a serial 
processor. 
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Consider a variation on this example. 
Assume the N-step serial task is: 

for i ■ 1 to N-l do 

C(i) » A (i ) + B(i-l) 

C(0) » A (0) 

Given the data allocation above (i.e., A(i), 
B(i), and C(i) ir. PE(i)), an SIMD machine 
does this task in three different steps: 

1. The value of B(i-l) is moved, 
through the interconnection network, from 
PE i-1 to PE i, 1 < i < n. Most proposed 
and existing SIMD Interconnection networks 
can do this in one parallel data transfer 

[17]. 

2. In PE i, add A(i) to B(i-l) and 
store the result in C(i), 1 _< i < N (PE 0 
is disabled) . 

3. In PE 0, store A(0) in C (0) (all 
other PEs are disabled). 

Thus, this example demonstrates the 
need for the interconnection network and 
methods for disabling PEs. 

This simple example was provided to 
familiarize the reader with the concept of 
the SIMD mode of parallel processing. More 
complex examples involving image process- 
ing and feature extraction can be found in 
[18,19]. 

Consider the implementation of the 
contextual classifier discussed in Sections 
II and III on a microprocessor-based SIMD 
machine. Recall that the neighborhood is 
as shown in Figure II. 1, i.e., a horizon- 
tally linear neighborhood with p=3. The 
approach to decomposing the task will be 
similar to that used in Section III for 
the FP system. In both cases, the image 
is divided into N subimages, and each sub- 
image is assigned to a different processor 
for classification computations. However, 
there are three main differences: 


1. It is technologically and economi- 
cally feasible to construct a multimicro- 
processor SIMD machine with many more than 
16 processors. Therefore, while the "N" 
for the FP system is limited by 16, the 
"N" for the multimicroprocessor system 
could be as large as 256, 512, or 1024. 

2. The differences in computational 
capabilities between an FP and an off-the- 
shelf microprocessor must be considered. 

For example, depending on the microproces- 
sor chosen, 16 FPs may be faster than 32 
microprocessors. 


3. In the SIMD mode of parallelism, 
the program (Algorithm 2) is stored in the 
control unit, not in each microprocessor. 
The control unit broadcasts the instruc- 
tions to the microprocessors. The control 
unit would also store the g p array, broad- 
casting the appropriate array element to 
all the microprocessors when it is needed. 
In the FP system, each FP would store a 
copy of the program and must store or have 
access to the g p array. 


Thus, a 3IMD machine can be used to 
perform the contextual classification 
based on a horizontally linear neighbor- 
hood of size three without any inter-PE 
communication. As in the case of using 
the FP system to implement the classifier, 
the implementation using an SIMD machine 
with N microprocessors can achieve as much 
as a factor of N improvement over the use 
of a single microprocessor. The exact 
improvement will be a function of the image 
size and N. 

To attain a perfect factor of N im- 
provement, B (in Figure I II. 4) would have 
to be a multiple of N . Since N in the 
SIMD case would be a multiple of the N in 
the FP case, this is less likely to occur. 
When B is not a multiple of N, then (a) 
some PEs may have to process more rows 
than others (leaving some PF.s underutil izr- 
ed) , or (b) each PE would process a sub- 
image including a partial row (requiring 
inter-PE data transfers). The alternative 
which is best would depend on the image 
size, the way in which subimages are allo- 
cated to PEs, N, the processor speed, and 
the interconnection network speed. The 
situation for vertically linear and dia- 
gonally linear neighborhoods is similar. 
Nonlinear neighborhoods require inter-PE 
communications, but the best way to imple- 
ment such a classifier would depend on the 
factors just mentioned and the neighborhood 
size and shape. These implementation con- 
siderations are currently being explored. 


V. CONCLUSIONS 

Algorithms for performing contextual 
classifications using a size three horizon- 
tally linear neighborhood were presented. 
Algorithm 1 was a straightforward approach. 
Algorithm 2 was a more efficient approach 
that avoided redundant calculations. The 
serial computational complexity of Algo- 
rithm 2 was shown to have a growth propor- 
tional to I*J*C assignments, multiplica- 
tions, and additions, and I*J*C "ccr.p:"" 
calculations. The way in which N ITs 
could, perform the classifications N times 
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faster than a single FP was explained. 

The use of N microprocessors in the SIMD 
mode of parallel processing to do the 
classifications N times faster chan a 
single microprocessor was discussed. 

In summary, contextual classifiers 
have been shown to be powerful remote 
sensing tools in other papers. Their main 
disadvantage is their computation complex- 
ity. This paper has demonstrated how 
parallel processing can be used to over- 
come this disadvantage. 
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Main Loot 


Discriminant Function Calculation 


for i = 0 to 1-1 do /* row */ 
begin 

for j = 1 to J-2 do /* column V 


begi n /* for each pixel */ 

value = -1 /* max "g" V 

class = -1 /* class with max 

"g" V 

for k = 1 to C'^o /* for each class */ 


current = g(i,j,k) 
if current > value 


function 


sum = 0 


g(i, j,k) 


for r 1 1 to C do /* all possible 

classes */ 

begin 

for q = 1 to C do /* all possible 

classes */ 

begin 

sum = compf (i, j-1, r) ‘corr.pf (i, j ,k) 
*compf (i , j+1 , q) *G (r , k,q) +sum 


then value = current 
class ^ k 


print Pixel (i,j) is classified as 
"class” 


return (sum) 


Class- Conditional Density Calculation 

function compf(a,b,k) /* for pixel (a,b), 

class k */ 

x = A(a,b) /* x is pixel measurement 

vector */ 

expo = logl-fc 1“ [l x-n x ,T (x ' rr 'k ) ] * 

. , expo, 

return (e ) 


Figure II. 2. Algorithm i --Implementa- 
tion of a contextual classifier. 

Main Loop. 


Figure II. 2 (cont.). Algorithm 1 
Discriminant function and class- 
conditional density routines. 
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Main Loop 

for i = 0 to 1-1 do /* row */ 
begin 

for k = 1 to C do 

begin /* compute f's for 1st 3 

columns */ 

X(k) = compf (i,0,k) 

Y(k) = compf (i,l,k) 

Z(k) = 'compf (i,2,k) 

end 

for j = 1 to J-2 do /* column */ 
begin /* for each pixel */ 

value = -1 /* max "g" */ 

class = -1 /* class with max "g” */ 

for k = 1 to C do 
begin 

current = g 1 (k) 
if current > value 

then value = current 
class = k 

end 

print Pixel (i,j) is classified as 

"class " 

if j < J-2 

then /* update X,Y,7, arrays */ 
for k = 1 to C do 
begin 

X(k) = Y (k) 

Y (k) = Z(k) 

Z(k) = compf (i,j+2,k) 
end 

end 

end 


Figure H.3. "Algorithm 2 — Implemen- 
tation of a contextual classifier. 
Main Loop. 


Discriminant Function Calculation 
function g* (k) 
sum = 0 

for r = 1 to C do /* all possible 

classes */ 

begin 

for q = 1 to C do /* all possible 

classes */ 

begin 

sum ** X(r) * Y vk) * Z(q) 

*G(r,k,q) + sum 


end 

end 

return (sum) 


Figure IT. 3 (cont.). Algorithm 2 

Discriminant function calculation. 





(i, j-X) 

(i, j) 

<i,j+l) 


Figure II.l. A p=3 context array 
(neighborhood) . 



tt it 


Figure l'll.l. Data path organization 
in the CDC Flexible Processor. 
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